A phonetic concatenative approach of labial coarticulation
نویسندگان
چکیده
Predicting the effects of labial coarticulation is an important aspect with a view to developing an artificial talking head. This paper describes a concatenation approach that uses sigmoids to represent the evolution of labial parameters. Labial parameters considered are lip aperture, protrusion, stretching and jaw aperture. A first formal algorithm determines the relevant transitions, i.e. those corresponding to phonemes imposing constraints on one of the labial parameters. Then relevant transitions are either retrieved or interpolated from a set of reference sigmoids which have been trained on a speaker specific corpus. This labial corpus is made up of isolated vowels, CV, VCV, VCCV and 100 sentences. A final stage consists in improving the overall syntagmatic consistency of the concatenation.
منابع مشابه
Inter speaker variability of labial coarticulation with the view of developing a formal coarticulation model for French
Explaining the effects of labial coarticulation is a difficult problem that gave rise to many studies and models. Most of the time small corpora were exploited to design these models. In this paper we describe the realization and exploitation of a corpus with ten speakers. This corpus enables the most invariant labial features (protrusion, stretching and lip opening) to be established. Then we ...
متن کاملThe interaction of gradient and categorical processes of long-distance vowel-to-vowel assimilation in Kazan Tatar
Conklin, Jenna T. M.A., Purdue University, May 2015. The Interaction of Gradient and Categorical Processes of Long-Distance Vowel-to-Vowel Assimilation in Kazan Tatar. Major Professor: Dr. Mary Niepokuj. Vowel harmony and vowel-to-vowel coarticulation are long-distance assimilatory processes wherein certain vowels trigger systematic changes in adjacent vowels; harmony effects phonological chang...
متن کاملComparison between two predicting methods of labial coarticulation
The construction of a highly intelligible talking head involving relevant lip gestures is especially important for hearing impaired people. This requires realistic rendering of lip and jaw movements and thus relevant modeling of lip coarticulation. This paper presents the comparison between the Cohen & Massaro prediction algorithm and our concatenation plus completion strategy guided by phoneti...
متن کاملمراحل و نحوه ی تهیه ی دادگان های صوتی هجایی و دایفونی برای سامانه ی تبدیل متن به گفتار فارسی
Abstract Speech databases are part of the concatenative text to speech synthesis systems. Phonetic quality of the databases plays a significant role in the naturalness of the synthesized speech. This paper introduces two syllable and diphone speech databases for Persian and investigates the way of their development and their specifications and their advantages to each other. ...
متن کاملCorpus-based Mandarin speech synthesis with contextual syllabic units based on phonetic properties
This paper describes an improved concatenative synthesis module for a Chinese text-to-speech system [1]. The concatenated segments are on-line selected from a designed speech corpus that is precisely segmented with an improved version of HMM models. The selection criteria are the prosodic and contextual similarities between the units and the desire targets from the previous module of the TTS sy...
متن کامل